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Abstract. Models for generating simple graphs are important in the 
study of real-world complex networks. A well established example of 
such a model is the erased configuration model, where each node re¬ 
ceives a number of half-edges that are connected to half-edges of other 
nodes at random, and then self-loops are removed and multiple edges are 
concatenated to make the graph simple. Although asymptotic results for 
many properties of this model, such as the limiting degree distribution, 
are known, the exact speed of convergence in terms of the graph sizes re¬ 
mains an open question. We provide a first answer by analyzing the size 
dependence of the average number of removed edges in the erased con¬ 
figuration model. By combining known upper bounds with a Tauberian 
Theorem we obtain upper bounds for the number of removed edges, in 
terms of the size of the graph. Remarkably, when the degree distribution 
follows a power-law, we observe three scaling regimes, depending on the 
power law exponent. Our results provide a strong theoretical basis for 
evaluating finite-size effects in networks. 


1 Introduction 

The use of complex networks to model large systems has proven to be a powerful 
tool in recent years. Mathematical and empirical analysis of structural properties 
of such networks, such as graph distances, centralities, and degree-degree corre¬ 
lations, have received vast attention in recent literature. A common approach for 
understanding these properties on real-world networks, is to compare them to 
those of other networks which have the same basic characteristics as the network 
under consideration, for instance the distribution of the degrees. Such test net¬ 
works are usually created using random graph models. An important property 
of real-world networks is that they are usually simple, i.e. there is at most one 
edge between any two nodes and there are no self-loops. Hence, random graph 
models that produce simple graphs are of primary interest from the application 
point of view. 

One well established model for generating a graph with given degree distri¬ 
bution is the configuration model [SI [HUH], which has been studied extensively 
in the literature [iiiaiisiiii]. In this model, each node first receives a certain 
number of half-edges, or stubs, and then the stubs are connected to each other 


at random. Obviously, multiple edges and self-loops may appear during the ran¬ 
dom wiring process. It is well-known that when the degree distribution has finite 
variance, the graph will be simple with positive probability, so a simple graph 
can be obtained by repeatedly applying the model until the resulting graph is 
simple. However, when the variance of the degrees is infinite the resulting graph 
will, with high probability, not be simple. To remedy this, one can remove self¬ 
loops and concatenate the multiple edges to make the graph simple. This version 
is know as the erased configuration model. Although removal of edges impacts 
the degree distribution, it has been shown that asymptotically the degree dis¬ 
tribution is unchanged. For a thorough systematic treatment of these results we 
refer the reader to |12) . 

An important feature of the configuration model is that, conditioned on the 
graph being simple, it samples a graph uniformly from among all simple graphs 
with the specified degree distribution. This, in combination with the neutral 
wiring in the configuration model, makes it a crucial model for studying the 
effects of degree distributions on the structural properties of the networks, such 
as, for instance, graph distances [iniiiiiiiiio] and epidemic spread [niiiiiii]. 

We note that there are several different methods for generating simple graphs, 
sampled uniformly from the set of all simple graphs with a given degree sequence. 
A large class of such models use Markov-Chain Monte Carlo methods for sam¬ 
pling graphs uniformly from among all graphs with a given set of constraints, 
such as the degree sequence. These algorithms use so-called edge swap or switch¬ 
ing steps, [niiHiiia, each time a pair of edges is sampled and swapped, if this 
is allowed. The main problem with this method are the limited theoretical re¬ 
sults on the mixing times, in [7] mixing times are analyzed, but only for regular 
graphs. Other methods are, for instance, the sequential algorithms proposed in 
Hi which have complexity 0{EN‘^) and 0{EN), respectively, where N is the 
size of the graph and E denotes the number of edges. The erased configura¬ 
tion model however,is well studied, with strong theoretical results and is easy to 
implement. 

In a recent study [22) . authors compare several methods, including the above 
mentioned Markov-Chain Monte Carlo methods, for creating test graphs for the 
analysis of structural properties of networks. The authors found that the number 
of removed edges did not impact the degree sequence in any significant way. 
However, several other measures on the graph, for instance average neighbor 
degree, did seem to be altered by the removal of self-loops and double-edges. 
This emphasizes the fact that asymptotic results alone are not sufficient. The 
analysis of networks requires a more detailed understanding of finite-size effects 
in simple random graphs. In particular, it is important to obtain a more precise 
characterization for dependence of the number of erased edges on the graph size, 
and their impact on other characteristics of the graph. 

In our recent work m we analyzed the average number of removed edges 
in order to evaluate the degree-degree correlations in the directed version of 
the erased configuration model. We used insights obtained from several limit 
theorems to derive the scaling in terms of the graph size. Here we make these 


rigorous by proving three upper bounds for the average number of removed 
edges in the undirected erased configuration model with regularly varying degree 
distribution. We start in Section 2 with the formal description of the model. Our 
main result is stated in Section 3 and the proofs are provided in Section 4. 


2 Erased Configuration Model 

The Erased Configuration Model (ECM) is an alteration of the Configuration 
Model (CM), which is a random graph model for generating graphs of size n with 
either prescribed degree sequence or degree distribution. Given a degree sequence 
D„ such that even, the degrees of each node are represented as stubs 

and the graph is constructed by randomly pairing stubs to form edges. This will 
create a graph with the given degree sequence. 

In another version of the model, degrees are sampled independently from a 
given distribution, an additional stub is added to the last node if the sum of 
degrees is odd, and the stubs are connected as in the case with given degrees. 
The empirical degree distribution of the resulting graph will then converge to 
the distribution from which the degrees were sampled as the graph size goes to 
infinity, see for instance | 12 ) . 

When the degree distribution has finite variance, the probability of creating 
a simple graph with the CM is bounded away from zero. Hence, by repeating the 
model, one will obtain a simple graph after a finite number of attempts. This 
construction is called the Repeated Configuration Model (RCM). It has been 
shown that the RCM samples graphs uniformly from among all simple graphs 
with the given degree distribution, see Proposition 7.13 in |12) . 

When the degrees have infinite variance the probability of generating a simple 
graph with the CM converges to zero as the graph size increases. In this case the 
ECM can be used, where after all stubs are paired, multiple edges are merged 
and self-loops are removed. This model is computationally far less expensive than 
the RCM since the pairing only needs to be done once while in the other case the 
number of attempts increases as the variance of the degree distribution grows. 
The trade-off is that the ECM removes edges, altering the degree sequence and 
hence the empirical degree distribution. Nevertheless it was proven, see [12], that 
the empirical degree distribution for the ECM still converges to the original one 
as n —>■ oo. 

For our analysis we shall consider graphs of size n generated by the ECM, 
where the degrees are sampled at random from a regularly varying distribution. 
We recall that X is said to have a regularly varying distribution with finite mean 
if 

F {X > k) = L{k)k~'^ with 7 > I, (1) 

where L is a slowly varying function, i.e. \im.x^oa L{tx)/L{x) = I for all t > 0. 
The parameter 7 is called the exponent of the distribution. 

For n S N we consider the degree sequence D„ as a sequence of i.i.d. samples 
from distribution o, let /X = E [D] and denote by L„ = f^e sum of the 

degrees. Formally we need to be even in order to have a graphical sequence. 


in which case Ln/2 is the number of edges. This can be achieved by increasing 
the degree of the last node by one if the sum is odd. This alteration adds a 
term uniformly bounded by one which does not influence the analysis. Therefore 
we can omit this and treat the degree sequence D„ as an i.i.d. sequence. 

For the analysis we denote by Eij the number of edges between two nodes, 
1 < bj < n, created by the CM and by E^- the number of edges between the 
two nodes that where removed by the ECM. Furthermore, we let P„ and E„ be, 
respectively, the probability and expectation conditioned on the degree sequence 


3 Main result 

The main result of this paper is concerned with the scaling of the average number 
of erased edges in the ECM. It was proven in m that 

^ ^ E„ [Efj] ^0 as rn> oo, (2) 

J-'n . ■ 


P 

where denotes convergence in probability. This result states that the average 
number of removed edges converges to zero as the graph size grows, which is in 
agreement with the convergence in probability of the empirical degree distribu¬ 
tion to the original one. However, until now there have not been any results on 
the speed of convergence in ([5]). In this section we will state our result, which 
establishes upper bound on the scaling of the average number of erased edges. 

To make our statement rigorous we first need to define what we mean by 
scaling for a random variable. 

Definition 1. Let (X„)„gN be sequences of random variables and let p S R. 
Then we define 

Xn = Op (n'’) for all e > 0 n ^ 0 , as n —>■ oo. 

We are now ready to state the main result on the scaling of the average 
number of erased edges in the ECM 

Theorem 1. Let Gn he a graph generated by the ECM with degree distribution 
o, let Ln be the sum of the degrees and denote by Ef^ the number of removed 
edges from i to j. Then 

^ fOp(n^"^^ */l<7<i, 

Op(n4-3) */|<7<2, (3) 

^ iop(n“^) i/7>2. 

The proof of Theorem[T]is given in the next section. The strategy of the proof 
is to establish two upper bounds for Y^^j=i [^fj] I for the case 1 < 7 < 2 , 


each of which scales as one of the first two terms from Then it remains to 
observe that the term dominates when 1 < 7 < 3/2 while the 

latter one dominates when 3/2 < 7 < 2. In addition, we prove the n~^ scaling 
for 7 > 2 . 

Theorem[T]gives several insights into the behavior of the ECM. First, consider 
the case when the degrees have finite variance (7 > 2). Equation Q tells us that 
in that case the ECM will erase only a finite, in n, number of edges. For large n, 
this alters the degree sequence in a negligible way. We then gain the advantage 
that we need to perform the random wiring only once. In contrast, the RCM 
requires multiple attempts before a simple graph is produced. This will be a 
problem, especially as 7 approaches 2 . 

An even more interesting phenomenon established by Theorem [T] is the re¬ 
markable change in the scaling at 7 = 3/2. Figure [T] shows the exponent in the 
scaling term in ([3|) as a function of 7 . Notice that for small values of 7 , the frac- 
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Fig. 1. The scaling exponent of the average number of erased edges, as a function of 

7- 


tion of erased edges decreases quite slowly with n. For example, when 7 = 1.1 
and n = 10® then « 284803. Hence, a significant fraction of edges will 
be removed, so we can expect notable finite size effects even in large networks. 
However, when 7 > 1.5 the finite size effects are already very small and decrease 
more rapidly with 7 . 

It will be seen from the proofs in the next section that the upper bounds for 
7 > 3/2 in Theorem [T] follow readily from the literature. Our main contribution 
is in the upper bound for 1 < 7 < 3/2, which corresponds to many real-world 
networks. The proof uses a Central Limit Theorem and a Tauberian Theorem 
for regularly varying random variables. Note that when 1 < 7 < 3/2 the upper 
bound is not at all tight and even increasing in n for 7 < 4/3. 







4 Upper bounds for erased edges 


Throughout this section we will use the Central Limit Theorem for regularly 
varying random variables also called the Stable Law CLT, see [IJ Theorem 

4.5.1. We summarize it below, letting A denote convergence in distribution, in 
the setting of non-negative regularly varying random variables. 

Theorem 2 (Stable Law CLT [24]). Let {Xi : i > 1} be an i.i.d. sequence of 
non-negative random variables with distribution o and 0 < 7 < 2. Then there 
exists a slowly varying function Lq, different from L, such that 

Yh=i Di - m„ 


Lo{n)r 


where S-y is a stable random variable and 
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[nE[U 


Sy, 


*/ 0 < 7 < 1 
*/7 = 1 

i/ 1 < 7 < 2 . 


From Theorem!^ we can infer several scaling results using the following ob¬ 
servation: By Slutsky’s Theorem it follows that 


ELi ^ 


Lo{n)n-> 

implies that for any e > 0 , 


Sy as n 


0 as n —)• oo. 


Lo{n)ni 

Hence \Ya=i U - = Op {Lo{n)n^ and therefore, by Potter’s Theorem, 

it follows that Er=i “ ^"1 = Finally, we remark that if D has 

distribution m with 1 < 7 < 2 , then has distribution m with exponent 
1/2 < 7/2 < 1. Summarizing, we have the following. 

Corollary 1. Let Gn be a graph generated by the ECM with degree distribution 
© and 1 < 7 < 2, then 


Ln = Op (n ), 




/in 


n 

Op and O/ = Op {n ~) 


i=l 


The third equation also holds for 7 = 2 since 
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= \ y,Dl - L^{n)n^V. 

i=l \i=l 
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= Op {Lo{n)n) np = Op (n). 



















4.1 The upper bounds Op (ji-i and Op (n 

For the proof of the upper bounds we will use the following proposition. 

Proposition 1 (Proposition 7.10 [12) 1. Let Gn he a graph generated by the 
CM and denote by Sn and Mn, respectively, the number of self-loops and multiple 
edges. Then 

^ n 2 / " jj2 \ ^ 

and E„[M„]<2 . 


2=1 


^ 2=1 


Lemma 1. Let Gn be a graph generated by the ECM with degree distribution 
(dJ; then 




i,j=i 


Or {n if 1 > 2. 


(4) 


Proof. We start by observing that 


'y ^ Eij — Sn + Mn, 
ij'=l 

and hence it follows from Proposition [T] that 

n " n2 / " n2^ 

EEdBjlsETr + dE-d 


2=1 


>.2 = 1 


First suppose that 1 < 7 < 2. Then, by Corollary [T] and the continuous 
mapping theorem it follows that 

I- E Kl s u E 7 aM =0, (ni-»), 


jj'=l 


Now suppose that 7 > 2. Then has finite mean, say ir, and therefore, by 
Theorem [2J 


1 " 




1 

Is 


Ea^- 


nu 


2=1 


+ ^ = Op(n? ^ + n = Op (n 


where the last step follows since 2/7 — 2 < —1 when 7 > 2. Since this is the 
main term the result follows. □ 


Lemma [T] provides the last two upper bounds from Theorem [T] However, as 
we mentioned before, the bound Op is not tight over the whole range 

1 < 7 < 2 since for 1 < 7 < 4/3 we have 4/7 — 3 > 0, and hence the upper 
bound diverges as n —>■ 00 which is in disagreement with Q. Therefore, there 
must exist another upper bound on the average erased number of edges, which 
goes to zero as n —>■ 00 for all 7 > 1. This new bound does not follow readily from 
the literature. Below we establish such upper bound and explain the essential 
new ingredients needed for its proof. 




4.2 


The upper bound Op 



We first observe that the number of erased edges between nodes i and j equals 
the total number of edges between the nodes minus one, if there is more than 
one edge. This gives, 


1 ” 

= -j— ^ \Eij — l|£;.^.>o}] 

^ n 1 ^ 

= -j— ^ [Eij] — — ^ E„ [l — 

. . _ -^75 


1 


ij'=l 


= 1 -+ -r E = 0 ). 


(5) 


jj'=l 


We can get an upper bound for P„ {Eij = 0) from the analysis performed 
in m, Section 4. Since the probability of no edges between i and j equals the 
probability that none of the Di stubs connects to one of the Dj stubs, it follows 
from equation (4.9) in [13] that 


Pra {Eij 


Di-1 

o)< n 


A;=0 



Dj 

- 2A - 1 


D^Dj 

(L„-2A)2' 


( 6 ) 


The product term in (jbj) can be upper bounded by exp{—DiDj/En}. For the 
second term we use that 


J_ V 

Ln {Ln - 2D,f 

zj = l 


£2 




< = Of 


i=l 



Putting everything together we obtain 


1 

-j— E] 

-^n . . T 


0 ) < ^ exp 
*j'=i 



(7) 


We will use d?]) to upper bound ([S|). In order to obtain the desired result 
we will employ a Tauberian Theorem for regularly varying random variables, 
which we summarize first. We write a ^ b to denote that a/b goes to one in a 
corresponding limit. 







Theorem 3 (Tauberian Theorem, [3|). Let X be a non-negative random 
variable with only finite mean. Then, for 1 < 7 < 2, the following are equivalent, 


i) P {X > t) L{t)t as t 


a) 


E[X] 


- 1 + exp < - 


X 
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00 , 
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as t 


00 . 


We will first explain the idea behind the proof of the Op bound. If 

we insert © into we get 


— ^ E„ [Ef^] < 1 - ^ ^ exp i - 




D+Di 


ij=i 


+ Op (^nr . (8) 


The terms on the right side can be rewritten to obtian an expression that resem¬ 
bles an empirical version of the left hand side of part ii) from Theorem [3l with 
t = Ln and X = O 1 O 2 . Thus, the scaling of the average number of erased edges 
will be determined by the scaling that follows from the Tauberian Theorem and 
the Stable Law CLT. 


Proposition 2. Let Gn be a graph generated by the ECM with degree distribu¬ 
tion o and 1 < 7 < 2. Then 
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Ln 




E, 


[%] = Op 


(9) 


Proof. We start with equation ([ 8 ]). Since the correction term here is of lower 
order, by extracting a factor n^/L„ from the other terms and using that Ln = 
suffices to show that 



We first consider the term inside the brackets in the left hand side of m- 
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Since 
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Ln 
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E! DiDj, 


it follows from Corollary[T]that both (fTTl) and (IT^ are Op . Next, observe 

that the function e““ — 1 + a; is positive which implies, by Markov’s inequality, 
that CSl) scales as its average 


E[0i02] 

/rn 


- 1 +E 




(14) 


where Di and D 2 are two independent random variables with distribution m 
and 1 < 7 < 2 , so that the product D 1 D 2 again has distribution (HD with the 
same exponent, see for instance the Corollary after Theorem 3 in [5]. Now we 
use Theorem [3] to find that da, and hence (IT^ are Op (n Finally, the term 
outside of the brackets in da is Op (n) and since 1 — 7 < i — 1 for all 7 > 1 , 
the result follows. □ 


5 Discussion 

The configuration model is one of the most important random graph models 
developed so far for constructing test graphs, used in the study of structural 
properties of, and processes on, real-world networks. The model is of course 
most true to reality when it produces a simple graph. Because this will happen 
with vanishing probability for most networks, since these have infinite degree 
variance, the ECM can be seen as the primary model for a neutrally wired 
simple graph with scale-free degrees. The fact that the fraction of erased edges is 
vanishing, suffices for obtaining asymptotic structural properties and asymptotic 
behavior of network processes in the ECM. However, real-world networks are 
finite, albeit very large. Therefore, it is important to understand and quantify 
how the properties and processes in a finite network are affected by the fact that 
the graph is simple. 

This paper presents the first step in this direction by providing probabilistic 
upper bounds for the number of the erased edges in the undirected ECM. This 
second order analysis shows that the average number of erased edges by the ECM 
decays as n~^ when the variance of the degrees is finite. Since the ECM is com¬ 
putationally less expensive then the RCM and other sequential algorithms, this 
is a strong argument for using the ECM as a standard model for generating test 
graphs with given degree distribution. Especially since, in contrast to Markov- 
Chain Monte Carlo methods using edge swap mechanics, it is theoretically well 
analyzed. We also uncover a new transition in the scaling of the average number 
of erased edges for regularly varying degree distributions with only finite mean, 
in terms of the exponent of the degree distribution. 

Based on the empirical results found by us in [16], we conjecture that the 
bounds we obtained are tight, up to some slowly varying functions. Therefore, 












as a next step one could try to prove Central Limit Theorems for the number 

of erased edges, using the bounds from Theorem [T] as the correct scaling factors. 

These tools would make it possible to perform statistical analysis of properties 

on networks, using the ECM as a model for generating test graphs. 
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